Exploring uplift modeling with high class imbalance

نویسندگان

چکیده

Abstract Uplift modeling refers to individual level causal inference. Existing research on the topic ignores one prevalent and important aspect: high class imbalance. For instance in online environments uplift is used optimally target ads discounts, but very few users ever end up clicking an ad or buying. One common approach deal with imbalance classification by undersampling dataset. In this work, we show how can be extended modeling. We propose four methods for compare proposed empirically when some have a tendency break down. key observation that accounting particularly random forests, which explains poor performance of model earlier works. Undersampling also crucial class-variable transformation based models.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pessimistic Uplift Modeling

Uplift modeling is a machine learning technique that aims to model treatment effects heterogeneity. It has been used in business and health sectors to predict the effect of a specific action on a given individual. Despite its advantages, uplift models show high sensitivity to noise and disturbance, which leads to unreliable results. In this paper we show different approaches to address the prob...

متن کامل

Uplift Modeling in Direct Marketing

Marketing campaigns directed to randomly selected customers often generate huge costs and a weak response. Moreover, such campaigns tend to unnecessarily annoy customers and make them less likely to answer to future communications. Precise targeting of marketing actions can potentially results in a greater return on investment. Usually, response models are used to select good targets. They aim ...

متن کامل

Classification with class imbalance problem: A Review

Most existing classification approaches assume the underlying training set is evenly distributed. In class imbalanced classification, the training set for one class (majority) far surpassed the training set of the other class (minority), in which, the minority class is often the more interesting class. In this paper, we review the issues that come with learning from imbalanced class data sets a...

متن کامل

Dealing with Class Imbalance using Thresholding

We propose thresholding as an approach to deal with class imbalance. We define the concept of thresholding as a process of determining a decision boundary in the presence of a tunable parameter. The threshold is the maximum value of this tunable parameter where the conditions of a certain decision are satisfied. We show that thresholding is applicable not only for linear classifiers but also fo...

متن کامل

Class Imbalance Learning

This report presents the work completed since the thesis proposal and the revised plan for the future PhD study. Two main issues have been discussed so far: diversity analysis of ensemble models in class imbalance learning, exploration of negative correlation learning on imbalanced data. Experimental design and main conclusions are simply described. More details are included in the two papers i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Data Mining and Knowledge Discovery

سال: 2023

ISSN: ['1573-756X', '1384-5810']

DOI: https://doi.org/10.1007/s10618-023-00917-9